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METHODS OF DETECT1K6 A GEKSTXC 
PXtEDISK>SITION FOR OSTEOARXHRXTXS 

ACKNOWI^DGEHENT OF GOVERKMENT RX6HTS 

Research for this invention was supported in part by 
5 the National Institutes of Health Grants AR-381 88 and 
AR-39740. The government may have certain rights in the 
invention. 

FXEZ.D OF XHVEKTXON 

The present invention relates to the field of 
10 compounds and methods for detecting genetic diseases linked to 
anomalies of genes for collagens and other structural proteins 
found in cartilage and joints. 

BACKGROUND OF tTHE XNVENTXON 

Osteoarthritis is a progressive disease of joints that 

15 is a cause of serious disability in large numbers of people. 
The disease is defined as a progressive degeneration of the 
cartilaginous surfaces of joints that leads to stiffness, pain, 
and loss of mobility. Degeneration of the cartilaginous 
surface of joints seen in osteoarthritis can have a number of 

20 causes. For example, severe trauma to a joint or a bacterial 
infection in a joint can produce degeneration of the joint that 
is either immediate or slowly progressive over many years. A 
number of metabolic disturbances are also )cnov to produce 
degeneration of joints. 

25 Cartilage and membranes that line joints are complex 

structures. A major source of the strength of cartilage is the 
fibrils of type II collagen. The fibrils of type II collagen 
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are stretched into three-dimensional arcades primarily by the 
presence of another group of macromolecules called 
proteoglycans. Proteoglycans are highly charged and, 

therefore, absorb water and salts and thereby extend the 
5 arcades of type II collagen fibrils. As a result, a highly 
resilient structure is formed that can withstand the 
intermittent pounding and pressures that joints must undergo • 
In addition to proteoglycans and type II collagen, cartilage is 
known to contain at least four other kinds of collagens (types 

10 VI, IX, X and XI) in lesser amounts than type II collagen. It 
is very likely that additional collagens will be discovered in 
cartilage in the future. In addition, it is clear that the 
matrix of cartilage also contains a number of other proteins 
that are still poorly characterized and that may contribute to 

15 the structure and function of the tissue. 

Collagens, proteoglycans and other proteins found in 
the matrix of cartilage are synthesized by cells embedded 
within the matrix. The matrix is actively synthesized during 
embryonic development of certain tissues and during periods of 

20 growth. The rates of synthesis and degradation of the matrix 
are less during adult life. However, throughout life, a 
continual slow synthesis and degradation of cartilage occurs, 
particularly in response to the pressures associated with 
physical activity. 

25 cartilage itself has several different functions in 

the body. E)uring embryonic development, transient tracks of 
type II collagen and probably other components of cartilage are 
formed in many structures. The tracks appear to serve as a 
guide for cell migration and a template for formatting of 

30 skeleton and associated structures. in addition, cartilage 
serves as a precursor structure for many bones. During the 
development of long bones such as those of the arms and legs, 
cartilage is part of the growth plate in which cell growth 
occurs. More specifically, the cartilage grows away from the 

35 midpoint of the long bone and is continually degraded and 
gradually replaced by bone itself. An additional function of 
cartilage is to give shape and form to tissues such as the nose 
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and ears. Many of tiiB macromolecules found in cartilage are 
also present in the vitreous gel of the eye and account for the 
high viscosity of the vitreous. Still another major function 
of cartilage is to provide strength and resilience to 
5 structures such as the intervertebral disc of the spine. In 
joints^ it provides not only strength and resilience, but also 
the smooth surfaces for motion tinder heavy loads. 

The degeneration of joint cartilages that occurs in 
osteoarthritis is caused by a failure of the cartilage to 

10 maintain its structural integrity. In this process, the 
cartilage surface is eroded by physical pressures and is not 
adequately replaced by the new synthesis of cartilage- Instead 
of adequate repair of cartilage, secondary chcuiges occur in the 
joint surface and in the joint. These changes include, for 

15 example, inf lainmatory responses characterized by invasion of 
white cells and macrophages, abnormal deposition of mineral in 
the form of calcitm and phosphate within the joint space and in 
the cartilage Itself, deposition of fibers of type I and other 
collagens that are not normally part of cartilage or the joint, 

20 abnormal growth of cartilage cells and matrix at locations 
adjacent to the joint surface and abnormal calcifications of 
the joints and associated structures ♦ As part of the complex 
changes that occur, the cells of the cartilage or the invading 
cells from the blood stream begin to secrete degradative 

25 enzymes that further contribute to the degradation of the joint 
structures • 

In the more severe diseases of cartilage known as 
chondrodysplasias, serious defects in the formation of 
cartilage are apparent early in life and there is a failure of 

30 joints to develop their normal size and shape. There is also 
a secondary failure of bone growth seen in these diseases. 
Moreover, there can be a failure of normal development of many 
tissues such as failure to achieve closure of normal partitions 
between oral and nasal passages, known as cleft palate, and 

35 improper development of the vitreous gel of the eye that causes 
severe myopia and retinal detachment. 
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Research has demonstrated that some forms of 
osteoarthritis and related conditions are caused by mutations 
in the genes that code for and, therefore, determine the 
structure of the collagens that are the major source of the 
5 strength of cartilage. Mutations of collagen that have been 
defined include, for example, mutations in the gene for type II 
collagen and its precursor type II procollagen. These 
mutations are of two general kinds. One kind of mutation 
decreases the synthesis of type II procollagen. The second 
10 kind of mutation leads to the synthesis of a defective form of 
type II procollagen. As a result of these mutations, there is 
either a decrease in the normal level of type II collagen in 
cart:ilage and other tissues that contain the protein or the 
formation of abnormal type II collagen fibrils that do not have 
15 the strength of normal type II collagen fibrils and, therefore, 
cause the cartilage of joints to be degraded by normal wear and 
tear. The two kinds of mutations can also produce drastic 
effects during normal growth and development. As a result, 
some individuals who inherit some of the mutated genes develop 
20 severe chondrodysplasias and die In utero or shortly after 
birth. Alternatively, such individuals can have serious 
deformities such as dwarfism which shows severe malformation of 
joints and may be associated with conditions of severe myopia, 
myopia with retinal detachment and blindness, cataracts, cleft 
25 palate, and unusual facial appearance* Other similar mutations 
in the same genes may produce much milder effects and cause 
progressive generalized osteoarthritis in which affected 
individuals are apparently normal until middle age when they 
develop progressive stiffness, pain and then immobility of many 
30 joints. Mutations of the gene for type II procollagen and 
collagen have been shown to cause these disorders. Research 
suggests that some of the conditions are caused by similar 
mutations in other genes that code for other structural 
macromolecules found in cartilage which contribute to its 
35 normal resistance to wear and tear. Methods and compounds are 
therefore desired for the analysis and detection of mutations 
in genes, both for type II procollagen and for a series of 



wo 94/11532 PCr/US93/10964 

- 5 - 

known collagens that are components of cartilage (types VI, IX, 
X and XI) , as well as for still undiscovered collagens and 
other structural proteins that contribute to the normal 
strength and function of cartilage, 

5 SDHtfJURY OF THE IKV£ll*rXON 

The invention provides probes and primers 
complementary to certain regions of collagen genes • Also 
provided are methods using such primers and probes for 
detecting mutations in a collagen gene sequence isolated from 

10 cells from individuals suspected of exhibiting mutant collagen 
gene expression or containing mutant collagen genes. Also 
included are methods to detect mutations in the collagen genes 
of members of the individual's family. 

More specifically, the invention provides a method 

15 whereby an individual who has developed osteoarthritis or a 
related condition, or is suspected of developing osteoarthritis 
or a related condition, is tested to see if the individual has 
a mutation in the 0NA of a gene for a structural protein that 
is a normal component of cartilage. This method provides that 

20 the DNA sequence of a collagen gene is exeuained in an 
individual with osteoarthritis or a related condition. The DNA 
sequence is also compared with corresponding regions of a 
standard DNA from a series of individuals known not to have the 
disease in question. Any difference in the base sequence from 

25 the DNA of the individual tested as compared to the standard 
sequence is then evaluated in terms of whether or not it 
indicates an increased likelihood of the individual suffering 
from osteoarthritis or related disorder. For the first member 
of a family tested, all or a siibstantial portion of DNA coding 

30 for the gene is sequenced and compared to the standard 
sequence. 

The invention also provides that once the location of 
the gene mutation causing the disease is known / it can be 
looked for in members of the first individual's family. For 
35 each genetically predisposed individual fsuaily member, the 
mutation in the gene is expected to appear in the same position 
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in the collagen gene tested- A difference in base sequence of 
the DNA from the individual tested as compared with the 
standard sequence can be evaluated to determine how the 
mutation will effect expression of the gene. The potential for 
5 the difference in DNA secpience to produce the disease can also 
be evaluated in terms of whether it changes an amino acid 
sequence that is critical for the nonaal functioning of the 
protein. 

Methods further provide that DNA derived from the 

10 cells of the test sample is analyzed to determine whether or 
not the collagen gene contains a mutation. If a mutation is 
found in the gene, a rapid test can be devised for other 
members of the patient's family to determine whether or not 
they have the same mutation. 

15 The methods of the invention are particularly useful 

in detecting mutations in human collagen genes. It is believed 
that these methods will also be useful in detecting mutant 
collagen genes in mammals. 

Xntronic sequences provided by the invention are 

20 useful for developing oligonucleotide primers to amplify and 
sequence genomic DNA from patients suspected of having 
mutations in the gene for the proal (11) chain of type IX 
procollagen which cause various disorxiers. 

Methods of the present invention for detecting 

25 mutations in the gene for type II procollagen can readily be 
applied to detection of mutations in genes coding for other 
strxicturaX proteins found in cartilage and associated tissues. 
For example, the nucleotide sequences of these genes can be 
used to design oligonucleotide primers to amplify genomic DNA 

30 or cDNA for the gene. The products obtained using PGR can then 
be used to define the base sequences of genomic DNA or cDNA. 
Mutations in the genes for these other collagens and structural 
proteins in matrix that cause osteoarthritis and related 
conditions can be detected .in the seuae manner as mutations in 
35 the gene for type II procollagen. 
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BRIEF DESCRXPTZON OF THE DRAWINGS 

Figure 1 shows a gene map of type II procollagen 
(C0L2A1) with introns and exons designated by lines and 
crosshatched boxes respectively, 
5 Figxire 2 shows a schematic for PGR amplification of 

COL2A1 exon 9 including the region of the gene comprising 
polymorphism 225 and 252 and agarose gels containing the 
amplification products. 

Figure 3 shows a schematic for PGR amplification of 
10 COL2A1 exons 27 and 28 including the region of the gene 
comprising an intron 26 polymorphism and agarose gels 
containing the amplification products • 

DETAILED DESCRIPTION OF THE IKVENTXOH 

The present invention concerns methods of diagnosing 

15 specific kinds of osteoarthritis in which there is systemic 
degeneration of many joints without any apparent external 
cause. In particular, the invention relates to progressive 
generalized osteoarthritis that produces degeneration of the 
cartilage of many joints. It also concerns diseases in which 

20 there is malformation of joints that is apparent at birth or 
during childhood atnd that leads to progressive degeneration of 
joints^ as well as other symptoms such as dwarfism and severe 
malformation of the skeleton and other cartilaginous tissues. 
More particularly, the invention is directed to diagnosing 

25 diseases characterized by degeneration of joints in which the 
presence of the disease in several members of the same family 
or similarity between the disease of a given patient and 
diseases seen in families indicates that the disease has a 
genetic origin. Therefore, the invention concerns not only 

30 progressive generalized osteoarthritis, but also a group of 
related disorders that are generally defined as 
chondrodysplasias. skeletal dysplasia or related disease 
involving abnormalities of grox^ and of cartilaginous 
structures and of tissues containing the same structural 

35 proteins as cartilage including chrondrodysplasias, epiphyseal 
dysplasia, metaphyseal dysplasia, spondyloepiphyseal dysplasia, 
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spondylometaphyseal dysplasia and arthro-ophthalmopathy (the 
Wagner-SticJcler syndrome) may be diagnosed using the methods of 
the invention. Skeletal disorders such as scoliosis and 
related conditions involving abnormalities in the cartilage and 
5 other components of the vertebrae column and back may also be 
diagnosed in accordance with the invention. 

More specifically, the present invention provides 
methods for gene analysis whereby one can definitively 
establish the cause of osteoarthritis and of chondrodysplasias 
10 in individuals and in members of the individuals* feunilies. 
Such methods will make it possible to identify individuals in 
families who are predisposed to develop these diseases. In the 
case of severe chondrodysplasias that produce crippling 
deformities and that may even be lethal, such methods will be 
15 useful for prenatal diagnosis « In the case of milder 
chondrodysplasias and osteoarthritis, the results of the gene 
analyses may be used to counsel individuals predisposed to 
develop the diseases concerning preventive exercise programs, 
life styles, choice of careers, and family planning. Also, the 
20 information generated by the gene analyses using the methods of 
the present invention may be used to develop new rational 
therapies for diseases of collagen. Moreover, the information 
can be used to develop animal models for such diseases. For 
example, information may be useful to develop transgenic mice 
25 that will provide new means of testing new agents to cure and 
prevent the diseases. 

Methods of the invention provide that an individual 
who has developed osteoarthritis or a related condition or is 
suspected of developing osteoarthritis or a related condition, 
30 is tested to see a if mutation in the DNA of a gene for a 
structural protein for excunple, type II procollagen, is 
present. The methods of the invention further provide that 
after a mutation causing the disease or diseases in one 
individual is found, it can be sought in members of the first 
35 individual's faunily. 

The gene C0L2A1 which encodes the proal (II) chain of 
type II procollagen is provided and examined as a demonstration 
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of the methods of the present invention. This demonstration in 
no way limits the scope of the claimed invention • In the first 
stage of the methods of the invention, the DNA sequence of the 
C0Ij2A1 gene is examined in an individual with a cartilage and 
5 j oint disease such as osteoarthritis . This DNA sequence is 
compared with corresponding regions of a standard DNA from a 
series of individuals known not to have the disease in 
question • It is believed that this strategy will be useful for 
any disease of collagen which exhibits at least one mutation in 
10 a collagen gene of the diseased mammal. The DNA sequences of 
the genes tested can be genomic DNA or cDNA prepared from RNA 
derived from a sample of cells or tissues taken from the 
individual* DNA may also be extracted from bodily fluids 
containing lysed cells. The standard DNA sequence and 
15 structure of the C0L2A1 gene can be obtained by reference to 
known sequences or those set forth herein in Figure 1 and 
listed in the literature or computerized data banks* Sequences 
of other collagen genes used in the methods of the present 
invention loay also be obtained from datcdjases or may be 
20 sequenced employing commonly used methods. See, for example, 
Sanger et al . , DNA Sequencing With Chain-Terminating 
InhdLbitors, Proc. Natl. Acad. Sex. USA 1977, 74, 5463-5467. 
Any difference in the base sequence from the DNA of the 
individual tested as compared to the standard sequence is then 
25 evaluated to determine whether it indicates an increased 
likelihood of the individual suffering from osteoarthritis or 
a related disorder. For the first member of a family tested, 
all or a substantial portion of DNA coding for the gene is 
sequenced and compared to the standard sequence. Sequencing of 
30 the first family member's DNA may be achieved by DNA sequencing 
techniques known in the art* These methods of the present 
invention are useful for all collagen genes. 

As used herein, the term "collagen" includes 
procollagens and collagens .types I to XVI, still undiscovered 
3 5 collagens similar to types I to XVI , and genes encoding 
proteins associated or comprising collagen polymers in tissue 
matrices . 
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The term "family member", as used herein, means 
individuals, including humans and other mammals, genetically 
related to one another in any degree, such as, for example, 
parent-child, siblings, cousins, etc« Such genetic relatedness 
5 can be determined using standard methods known in the art 
including, for example, pedigree analysis or DNA 
"fingerprinting" . 

Moreover , the term " individual " , as used herein , 
denotes a mammalian individual of any species, including 
10 humans. 

As Illustrated by the demonstration using type II 
collagen, methods of detection of mutations in a collagen gene 
comprise several steps » One step involves selecting cells 
suspected of comprising a mutated collagen gene. Another step 

15 includes isolating genomic DNAs from selected cells or 
preparing cDNAs from selected cells. After the genomic DNA or 
cDNA is isolated, larger amounts of the gene sequences of 
interest are prepared by amplifying the DNA with the polymerase 
chain reaction (PCR) . The DNA produced by the PCR are then 

20 analyzed for the presence of a disease-causing mutation. The 
preferred strategy of analysis is to first screen the PCR 
products with a relatively rapid technique such as denaturing 
gradient gel electrophoresis (DGGE) that enables one to decide 
whether or not a specific PCR product from one region of the 

25 gene does or does not have a mutation such as a single base 
difference between the two alleles of the gene. PCR products 
detected as probably having a mutation by such a technique are 
then analyzed further by a technique such as dideoxynucleotide 
sequencing that provides the detailed base sequence that 

30 defines the mutation. The methods comprise the aforementioned 
steps and further comprise comparing the sequence of the 
collagen gene or other genes for structural proteins of 
cartilage containing the mutation to corresponding regions of 
a family member's structural protein genes and determining if 

35 the mutation is present in the fcuaily member's genes. 

It is believed that all of the methods of the present 
invention are useful to detect any mutation in all procollagen 
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and collagen genes ^ including, for example, procollagens and 
collagens I to XVI, still undiscovered procollagens and 
collagens similar to types I to XVI, as well as other genes 
associated with collagen structure, such as proteoglycans • It 
5 is also believed that these methods are also useful in maimnals 
other than humans* 

A second step of the invention provides that once the 
location of the mutation in a gene causing the disease is 
known, it can be sought in members of the first individual's 
10 family* For each genetically predisposed individual fcuaily 
member, the mutation in the gene is expected to appear in the 
same position in the structural protein gene tested. For 
example, in Family A, the genetic mutation may be at position 
30; and for Family B, the genetic mutation may be at position 
15 505. In accordance with the methods of the invention, testing 
the fcimily members can be done by comparing corresponding 
regions of family member's genes and determining if the 
mutation is present in the family member. This evaluation of 
a difference in base sec[uence of the DNA from the individual 
20 tested as compared with the standard sequence can be evaluated 
in terms of whether it is a disease-causing mutation by 
determining whether the mutation changes the level of 
expression of the gene in terms of the rate at which the gene 
is tremscribed into HNA, the rate at which the initial RNA is 
2 5 processed into mRNA and the rate at which the mRNA can be 
effectively used to synthesize mature collagen matrices, such 
as prooel (II) chains of type II procollagen. The potential for 
the difference in DNA sequence to produce the disease can also 
be evaluated in terms of whether it changes an amino acid 
30 sequence that is critical for the normal functioning of the 
protein by strategies used by those familiar in the art. The 
strategies include demonstration that the same gene mutation is 
not present in individuals unaffected by the disease in the 
same family of the general population, detailed linkage 
35 analysis of co- inheritance of the mutation with the disease 
phenotype in large families or a series of families with the 
same mutation, and structure- function studies on the mutated 
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protein obtained either by isolation of the protein from 
tissues of affected individuals or expression of the mutated 
gene in a recombinant system. 

In accordance with the methods of the invention, DNA 
5 is extracted from a test sample of cells of the family member 
to be tested by conventional techniques, such as lysis of the 
cells with sodium dodecyl sulfate (SDS) and digestion of 
protein with proteinase K, followed by extraction with phenol 
and chloroform, and ethanol precipitation as described by 

10 Maniatis et al., Molecalajr Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, 1982, 
pp 280-281. A sample of cells can be taken from any type of 
tissues? for example, a piece of skin, a sample of blood, or by 
scraping of the interior of the mouth. Alternatively, mRNA can 

15 be extracted from the test sample and cDNA synthesized with 
reverse transcriptase, and the resulting cDNA used for 
cuialysis. Although cartilage cells cannot regularly be 
obtained from patients, many other cells including wh.ite blood 
cells have been shown to contain small amounts of the mRNAs for 

20 protein synthesized by cartilage and the mRNAs can be analyzed 
after conversion to cDNAs as reported by Chan and Cole, Journal 
of Biological Chemistry X991, 266, 12487. 

Following extraction, DNA derived from the cells of 
the test sample is analyzed to determine whether the structural 

25 protein gene contains a mutation. If a mutation is found in 
the gene, a rapid test can be devised for other members of the 
patient's family to determine whether they have the sajae 
mutation. DNA and cDNA from other structural protein genes can 
also be used in this method of the invention. 

30 Although the methods of the invention have been 

demonstrated in the first instance in human beings, it is 
expected that they will be useful in other mammalian species, 
particularly commercially important species and in laboratory 
animals used as models of hxman disease. For example, it is 

35 believed that these methods will be particularly useful in 
detecting mutant collagen genes and transcripts in transgenic 
animals comprising mutant collagen genes or transcripts. 
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Some of the sequences for the normal COL2A1 gene can 
be found in the literature. For example, in Strom and Upholt, 
Nucl. Acids J?es. 1984, 12, 1025-1038; Cheah et al,, Proc. Natl. 
Acad Sci. USA 1985, 87, 2555-2559; Sangiorgi et al., Nucl. 
5 Acids R&s. 1985, 13, 2207-2225; Nunez et al.. Gene 1986, 44, 
11-16; Su et al,, GBZiomlcs 1989, 4, 483-441; Vikkula and 
Peltonen, FEES Lett. 1989, 250, 171-174; Upholt, Collagen Vol. 
4, CRC Press, Baton Rouge, FL, 1989, pp 31-49; Ala-Kokko and 
Prockop, Genomics 1990, 8, 454-460. However, the sequences for 
10 a major part of the introns used for the synthetic 
oligonucleotide primers employed for the analyses described 
here are provided as part of the present invention. Those 
skilled in the art recognize that rapid analysis of a gene by 
these procedures requires a series of oligonucleotides that are 
15 specifically targeted to regions of the gene under analysis. 
However, many base sequences are not efficient targets for 
oligonucleotide primers. In the case of procollagen and 
collagen genes, the number of efficient target sites is limited 
by the high GC content and repetitive nature of the coding 
20 sequences. Also, most of the genes contain a large number of 
introns, and many of the introns are too short to offer many 
potential target sites (see Figure 1) . Therefore, tbe present 
invention includes efficient ol igonucleotide primers for 
amplification of the type II procollagen gene by PCR, efficient 
25 oligonucleotide primers for analysis of PGR products of the 
type II procollagen gene by DGGE, and efficient oligonucleotide 
primers for dideoxynucleotide sequencing of PCR products of the 
type II procollagen. Table I presents specific oligonucleotide 
primers for amplification of the human type II procollagen gene 
30 by PGR and for dideoxynucleotide sequencing of the PGR 
products. Table II presents a series of neutral sequence 
variants detected in the type II procollagen gene that are 
important to avoid as target sites for primers, for defining 
haplotypes of the gene, and evaluating putative disease-causing 
35 mutations in the gene. Table III presents oligonucleotide 
primers that are efficient for amplifying regions of the gene 
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by PGR in a form that makes the PGR products suitable for 
screening for mutations by the technique of DGGE- 

The analytical methods that are part of the invention 
involve the PGR. One skilled in the art would recognize that 
5 there are many commonly employed schemes for amplifying nucleic 
acid sequences using PGR. See, for example, PCR Protocols,. A 
Guide to Methods and Applications, Innis et al, Eds., Academic 
Press, New York, 1990, and Ciurreixt Communications,. PolymejrasB 
Chain RBaction, Ehrlich et al* , Eds* , Gold Spring Harbor 
10 Laboratory Press, Cold Spring Harbor, NY, 1989. The PGR 
methods of the invention for amplifying nucleic containing at 
least one mutation in a collagen gene comprise several steps* 
The steps enumerated herein are merely exemplary of the steps 
commonly employed by one skilled in the art performing PGR. 
15 Other steps may be added to achieve DNA amplification. 
Included in the invention are steps for selecting cells 
suspected of comprising a mutated collagen gene and isolating 
nucleic acid from the cells. Initial PGR steps comprise 
contacting the nucleic acid with a first primer and a second 
20 primer, extending the first primer to create an extension 
product, contacting the extension product with a second primer, 
and extending the second primer to create another extension 
product. The extension products of the first round of 
synthesis may be longer than the extension products of the 
25 second round of synthesis. Extension products of the first 
round of S3^thesis are contacted with the primers comprising 
complementary sequences. The primers used in this step may be 
the same primers used in the first round of synthesis* These 
primers are extended to form "secondairy" extension products* 
30 These "secondary** extension products may be shorter than the 
initial extension products from which they were synthesized* 
Amplification steps comprise amplifying the "secondary** 
extension products using PCR* The initial runoff DNA extension 
products will be diluted during amplification. The 
35 amplification steps will favor synthesis of the "secondary" 
extension product DNAs having a length determined by the probes 
used. Once the PGR steps are completed, steps are provided for 
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delrecting the presence or absence of said mutatiion in at least 
one extension product • It is believed that these methods will 
be useful for analyzing cartilage matrix protein genes, 
especially in hxamans. It is also believed that these methods 
5 are useful to analyze the matrix protein genes of other 
proteins found in collagen matrices, such as non-collagenous 
stinictural protein of cartilage- Further, the PGR methods of 
the invention are useful for deteirmining if a mammal has a 
genetic predisposition for a disease exhibiting a mutant 
10 collagen gene. These methods include further steps of 
comparing the sequence of the collagen gene containing the 
mutation to corresponding regions of a family member's collagen 
genes and determining if the mutation is present in the family 
member's collagen genes. It is believed that these methods 
15 will also foe useful in other mammals. 

Methods of the present invention for detecting 
mutations in the gene for type II procollagen can readily be 
applied to detection of mutations in genes coding for other 
structural proteins found in cartilage and associated tissues. 
20 For example,, the nucleotide sequences of the genes for types 
VI, IX, X and XI collagens can be used to design 
oligonucleotide primers to amplify genomic DNA or cONA for the 
gene using pgr. The products obtained using PGR can then be 
used to define the base sequences of genomic DNA or cDNA* 
25 Therefore, mutations in the genes for these other collagens and 
structural proteins in matrix that cause osteoarthritis and 
related conditions can be detected in the same manner as 
mutations in the gene for type II procollagen . 

The following examples are illustrative of the 
30 invention. It is understood that this invention is not limited 
by these illustrative examples but solely by the claims 
appended hereto. 

EZAHPLES 
Example 1 

35 A series of procedures were developed for amplifying 

important regions of the gene for type II procollagen {C0L2A1) 
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by the PCR so that adequate amounts of DNA were generated for 
analysis. A further series of procedures made it possible to 
directly analyze the DNA produced by the PCR and define its 
base sequences • 

5 Figure 1 presents a diagram of the human COL2A1 gene 

indicating the location of the 54 exons of the gene. The 
figure also indicates^ using short horizontal lines ^ the 
regions of the genes that were amplified with the use of 
appropriate oligonucleotide primers and conditioned for 
10 amplification of the gene by PCR. Table I presents the 
sequencing and location of the oligonucleotide primers defined 
by the specific base sequences of the gene beginning with 
number 1 at the 5 '-end of the gene. Ala-Kokko and Prockop, 
Genomxcs X990, 8, 454-460, disclosed all the coding sequences 
15 of the C0L2A1 gene but no more than 40 intronic bases 
iimediately flanking most of the introns* 

Excuaples of primers effective for amplification of 
sequences by PCR and sequencing of the PCR products presented 
in Table I are within regions of the introns which have not 
20 been disclosed previously. These sequences and primers are 
included in the present invention* The steps whereby genomic 
DNA from an individual from sources such as white blood cells 
or any other cells in the body can be used in the procedure to 
detect variations in sequences are as follows. The DNA 
25 template is amplified using PCR. The PCR products can be 
loaded directly on an agarose gel and electrophoresed and 
analyzed or they can be diluted and subjected to a second round 
of PCR amplification, PCR products obtained from the first PCR 
reaction in which the regions of the gene symmetrically 
30 amplified were examined by electrophoresis in an agarose gel 
stained with ethidium bromide. The conditions of the 
experiment were adequate to generate an intense single band of 
DNA, an observation indicating that the target region was 
selectively amplified. If a second PCR amplification is 
35 carried out, the products are purified and sequenced. The 
second PCR in which the product of the first PCR is 
asymmetrically amplified generates a single- stranded DNA which 
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can be directly used for sequencing. The results indicate the 
presence of two major bands of DNA, one double-stranded DNA and 
the second single-stranded DNA appropriate for sequencing- 
Analysis of the base sequences of PGR products from the second 
5 asymmetric PGR were analyzed by dideoxynucleotide sequencing 
using methods known in the art* The sequencing autoradiograms 
. were of high enough quality to be edble to detect single-base 
. variations in which one allele of the gene has one base and the 
same position of the second allele from the same gene has a 
10 different base. For example, patients 1 and 5 had an A in one 
allele and a T in the other allele at position +5 of intron 9 
of the COL2A1 gene, whereas patients 2, 3, and 4 had only a T 
in this position. Moreover, patient 4 had an A and a G in 
position +45 of intron 9, whereas the other patients had only 
15 a G. similarly, DMA from one patient showed both a C and a T 
at position -47 of intron 26 of the C0L2A1 gene, whereas a 
different patient had a C at this position emd the patient in 
the right-hand four lanes had a T in this position. These 
results demonstrate that the procedures outlined here are 
20 adequate to detect single-base variations in a single allele. 
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(i) SEQUENCE CHARACTERISTXCS : 

(A) LENGTH: 19 

(B) TYPES NUCLEIC ACID 

(C) strandednessj single 

(D) TOPOLOGY: LINEAR 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GACTCCAGGC TACCACGAA (19) 
( 2 ) INFORMATION FOR SEQ ID NO: 94s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CATGGAGGAG TGATATTC (18) 
( 2 ) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: NUCLEIC ACID 

( C ) STRANDEDNESS : SINGLE 
(D) TOPOLOGY: LINEAR 
(XV) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
CTGCTGTGGA GAATTGTTC (19) 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 18 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
TGCTGTGGAG AATTGTTC (18) 
(2) INFORMATION FOR SEQ ID NOs 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDltESS : SINGLE 

(D) TOPOLOGY: LINEAR 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CAATGCGGGC TGCCTCCTTG (20) 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
AATGCGGGCT GCCTCCTT (18) 
(2) INFORMATION FOR SEQ ID NO: 99: 
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CIiAIMS 

What: is claimed is: 

1. A method of determining a genetic predisposition 
for a disease caused by a mutation in the DNA of a gene 

5 encoding a structural protein that is a normal component of 
cartilage comprising: 

selecting cells from an individual suspected of having 
a mutated structural protein gene; 

isolating nucleic acid from the cells; 
10 amplifying regions of the isolated nucleic acid that 

contain the DNA of a specific structural protein gene using 
specifically targeted oligonucleotide primers; 

analyzing the DNA of the structural protein gene for 
the presence of a mutation using the primers; 
15 detecting the presence of the mutation in at least one 

structural protein gene; 

comparing the sequence of the gene containing the 
mutation to corresponding regions of a family member 's collagen 
genes ; and 

20 determining if the mutation is present in the family 

member ' s genes . 

2. The method of claim 1 wherein the DNA analysis 
step comprises sequencing PGR products using the primers, 

3 . The method of claim 1 wherein the DNA analysis 
25 step comprises analyzing PGR products by denaturing gradient 

gel electrophoresis » 

4. The method of claim 1 wherein the amplification 
step comprises: 

selecting cells suspected of having a mutated 
30 structural protein gene; 

isolating nucleic acid from the cells; 

contacting the nucleic acid with at least two primers; 

synthesizing primer extension products from the 

primers ; 
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amplifying the extension products using polymerase 
chain reaction; and 

detecting the presence of a mutation in at least one 

extension product. 

5 5. The method of claim 1 wherein the structural 

protein gene is a collagen gene. 

6. The method of claim 1 wherein the structural 
protein gene is a C0L2A1 gene. 

7. The method of claim 5 wherein the gene encodes 
10 a non-collagenous structural protein of cartilage, 

8. The method of claim 2 wherein at least one of the 
primers comprises a sequence identified in Table I. 

9. The method of claim 3 wherein at least one of the 
primers comprises a sequence identified in Table III. 

15 10. The method of claim 1 wherein the disease is 

osteoarthritis . 

11. The method of claim 1 wherein the disease is a 
skeletal dysplasia or related disease involving abnormalities 
of growth and of cartilaginous structures and of tissues 

20 containing the same structural proteins as cartilage, selected 
from the group consisting of chondrodysplasias, epiphyseal 
dysplasia, metaphyseal dysplasia, spondyloepiphyseal dysplasia, 
spondylometaphyseal dysplasia and arthro-ophthalmopathy . 

12. The method of claim 1 wherein the disease is a 
25 skeletal disorder involving abnormalities in the cartilage and 

other components of the vertebral column and back. 



13. The method of claim 12 wherein the disease is 
scoliosis . 
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14 . A primer comprising a sequence complementary to 
an intronic sequence from a collagen gene. 

15 • A primer of claim 14 wherein the intronic 
sequence is an intronic sequence from a mammalian procollagen 
5 gene . 

16. A primer of claim 15 wherein the procollagen gene 
is a C0L2A1 gene. 

17 . A primer for use in detecting mutations in a 
mammalian gene for a structural protein of cartilage comprising 

10 a sequence identified in Table III. 

18. A primer for use in detecting mutations in a 
mammalian gene for a structural protein of cartilage comprising 
a sequence identified in Table I. 
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216 22S 252 
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330 bp 
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Excess 
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225 



252 



...TGAGTATACACGAATAGACAAATGAGGAGCTGCCTCCrTTQAAAGGGCCTGQ. 



Acc I restriction site: 
GljAOAC 

A/G 6/G A/A 



214bp-4 



116bp-l 
106 bpi 



DislrttHition of Polymorptiism 225 


A/G 


GIG 


A/A 


0.58 


0.18 


0.24 



dG 

Dra i restriction site: 
TT^AAA 



dG/dG G/dG G/G 




— 330 bp 

- 252 bp 



78 bp 



I^ribution of Polymorphism 252 


dG/dG 


G/dG 


G/6 


0.04 


0.34 


0.62 
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